Iranian Journal of Numerical Analysis and Optimization 


Vol. 10, No. 1, (2020), pp 19-32 


ae 
DOI:10.22067 /ijnao.v10i1.75932 yu 


Research Article 


Axial preferred solutions for 
multiobjective optimal control 
problems: An application to chemical 
processes 


G.H. Askarirobati, A.H. Borzabadi* and A. Heydari 


Abstract 


Detecting the Pareto optimal solutions on the Pareto frontier is one of 
the most important topics in multiobjective optimal control problems. In 
real-world control systems, there is needed for the decision-maker to apply 
their own opinion to find the preferred solution from a large list of Pareto 
optimal solutions. This paper presents a class of axial preferred solutions for 
multiobjective optimal control problems in contexts in which partial informa- 
tion on preference weights of objectives is available. These solutions combine 
both the idea of improvement axis and Pareto optimality with respect to 
preference information. The axial preferred solution, in addition to taking 
considerations of decision-makers, provides continuous functions for control- 
ling chemical processes. Numerical results are presented for two problems of 
chemical processes with two different preferential situations. 
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1 Introduction 


Multiobjective optimal control problem (MOCP) is concerned with a method- 
ology that can treat complex problems encountered in (bio) chemical engi- 
neering, optimal robot paths in mechanical engineering, and optimal rocket 
trajectories in aerospace engineering, where optimal decisions need to be 
taken in the presence of trade-offs between several conflicting objectives; see 
[13, 16, 17]. 

In single-objective optimal control problem, the determination of the opti- 
mum solution among a set of given solutions is clear. However, in the absence 
of preference information, in multiobjective optimal control, there does not 
exist a way to determine if a solution is better than other, but, instead, we 
produce a set of them called the Pareto optimal set [8]. 

To solve multiobjective optimal control problems, multiobjective opti- 
mization methods are used as basic methods. Multiobjective optimization 
methods depending on how the decision-maker (DM) articulates their pref- 
erences divided into four classes: methods that involve a priori articulation 
of preferences, methods with a posteriori articulation of preferences, meth- 
ods that require no articulation of preferences are addressed, and interactive 
methods [6]. 

In a priori method, preference information is first asked from the DM 
before running the optimization algorithm, and then a solution best satisfying 
these preferences is found. In a posteriori method, a representative set of 
Pareto optimal solutions is first found, and then DM must choose a single 
solution from a set of solutions. In no preference methods, often the DM 
cannot concretely define what he or she prefers, but solutions are identified 
without preference information. In interactive methods, the DM is allowed 
to search for the most preferred solution iteratively. In each iteration of the 
interactive method, the DM shows Pareto optimal solutions and describes 
how the solutions can be improved [5]. 

Well-known examples of priori methods include the weighted sum method 
that consists of assigning each objective function a weight coefficient and then 
optimizing the function obtained by summing up all the objective functions 
scaled by their weight coefficients that only one solution can be rendered 
accordingly. With different weights, different Pareto optimal solutions are 
produced [6]. Though computationally more expensive, this approach gives 
an idea of the shape of the Pareto front and provides the user with more 
information about the trade-off among the objectives [22]. 

Usually it is not economical to generate the entire Pareto surface, due to 
the high computational cost for function evaluations. Moreover, it may be 
hard for the DM to choose from a large list. To mitigate these problems, 
with applying the DM’s preferences, it is possible to create situations in 
which realistic solutions can be obtained that are acceptable to the DM. Since 
incorporation of the preference information of the DMs and the elicitation 
of the weights of the objectives are essential problems in multiple objective 
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optimal control problems, in this paper, we introduce an approach that partly 
deals with these difficulties. We consider a situation that partial information 
on the objectives importance is available, that is, the values of these weights 
are not precisely stated. 


There are many methods in literature, in which objective function coeffi- 
cients or preferential weight coefficients are not exactly specified, but given as 
intervals or by means of linear relations (see, for instance, [19, 15, 7]). Since 
these methods focus on the sensitivity of a given solution to feasible changes 
in the parameters, in this paper, we introduce specific solutions, for multiple 
objective optimal control problems. These solutions, in addition to taking 
considerations of DMs, provide continuous functions for controlling chemical 
processes. When using axial preferred solutions, a weighted sum of the ob- 
jectives also directs the search for the results. We assume that preferences 
are originally additive, however, consider an additional element in the form 
of an improvement axis to better representation. Applying an improvement 
axis, the different objectives are not treated equally even when the preference 
information does not different between them. 

The main idea of this paper is to incorporate partial information of the 
DMs on the preference weights of objectives in order to achieve optimal so- 
lutions on the Pareto frontier that are acceptable and realistic for the DM 
according to their information considerations. In order to achieve this aim, 
we have to introduce a new concept of solutions, axial preferred solutions, 
for multiobjective optimal control. These solutions combine both the idea of 
improvement axis and Pareto optimality with respect to preference informa- 
tion. 


The rest of the paper is organized as follows. In Section 2, mathematical 
formulations of general multiobjective optimal control problem are briefly in- 
troduced. Section 3 is devoted to the representation of preference structures 
and the corresponding definition of efficiency. Also in this section, the idea 
of improvement axis is described. The axial preferred solutions for multiob- 
jective optimal control problem are introduced in Section 4. In Section 5, 
two problems are presented with two different preferential situations. Lastly, 
section 6 outlines the conclusion. 


2 Multiobjective optimal control problem 


A general multiobjective optimal control problem consists of optimizing a 
vector of functions is defined as below: 


Opt (J(x,u) = (Ji (2, u), Jo(x,u),..., Im(@,u))) 


subject to: 
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“(t) = f(z, u,t), 
g(x, u,t) > 0, 

W(xo, xf, to, tf) > 0, 
t € [to, ty]. 


where J; are functions of the state variable x : [to, tf] > R”, control variable 
u € L™, and time t. For each individual cost function, let us here consider 
the following formulation: 


ty 
J; = p(x, ty) +f L,(x, u, t)dt. 


to 


The objective functions are J; : R"+? x R? x [to,t¢] 4 R. The objective vector 
is subject to a set of dynamic constraints with f : R" x R? x [to,t,] > R”, 
algebraic constraints g : R” x RP x [to,t;]| > R°, and boundary conditions 
R?n+2 _5 RI, The admissible set P C R” x R? x [to, tf] is defined to be the 
set of all feasible pairs state and control (z,u) that satisfy in (2-5). 

In MOCPs, usually objectives conflict with each other, so it is not easy 
to have an admissible pair (2*, u*) that optimizes all the objectives simulta- 
neously. Therefore, the concept of Pareto optimality is used. The concept 
of optimality in a single objective is not directly applicable in multiobjective 
optimization problems. For this reason, a classification of the solutions is in- 
troduced in terms of Pareto optimality, according to the following definitions 
(see [23, 18]) in terms of minimization of objective functions: 


Definition 1. It is said that a pair (2*,u*) € P dominates another pair 
(x,u) € P (denoted by (a2*,u*) > (x, u)) if the pair (7*,u*) is no worse than 
the pair (x,u) in all objectives and the pair (x*,u*) is strictly better than 
(x, u) in at least one objective. If there is no solution that dominates (x*,u*), 
then it is nondominated. 


Definition 2. A pair (2*,u*) € P is a Pareto optimal solution of the MOCP 
if and only if there is no other pair (x,u) € P that dominates (x*,u*). 


Definition 3. The set of nondominated pairs (x*,u*) € P such that 
{(a*,u*) € P|-d(a,u) € P : (x,u) > (a#*,u*)} is said to be a Pareto set. 
Also, the set of vectors in the objective space that are images of a Pareto set, 
is said to be a Pareto frontier. 


3 Partial preference information and improvement axis 


If a unique vector of weights is available, then the preferred solution is ob- 
tained. When several decision makings have to solve a MOP, but do not 
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agree on the weight of objectives, each DM will propose a different vector 
of weights for objectives. In this case, we have a partial information on the 
importance of the objectives. The contribution of this paper is not the elic- 
itation of the weights, instead we want to obtain the whole set of weights 
considered appropriate by the DM. In fact, we consider a situation in which 
the weights are not exactly specified, but only partial information about im- 
portance of objectives is available. Here, the partial information is denoted 
by 2 C A™~1, where A"! = {we RT| 0", w” = 1}. In particular, we 
consider situations in which 2 is a polyhedron and whose extreme points are 
wi,w?,...,w®. 


Definition 4. Suppose that («*,u*),(a,u) € P. Then it is said that a 
vector J(x*,u*) dominates another pair J(x,u) with respect to Q C A™} 
(denote this relationship by J(a2*,u*) +q J(ax,u)) if w.J(a*,u*) < w.J (a, u) 
for all w € 2. 


Definition 5. Suppose that (x*,u*), (x, u) € P. Then it is said that a vector 
J(x*,u*) weakly dominates another pair J(x,u) with respect to Q C A™}, 
denoted by J(x*,u*) =o J(a,u), if w.J(x*,u*) < w.J (x, u) for all w € 0. 


Definition 6. The point J(x*,u*) € J(P) is said to be Q-nondominated, if 
there is no J(z,u) € J(P) such that J(x,u) >: J(a*,u*). A pair (2*,u*) € P 
is an Q-Pareto optimal if J(2*, u*) is Q-nondominated. 


If there is no information about the weight of the objectives, then all the 
weights will be possible and, as a result, 2-Pareto optimal reduces to Pareto- 
optimality. On the other hand, the highest level of information is available, 
when a unique vector of weight is provided. The more the partial information, 
the smaller the set of Q-Pareto optimal solutions. However, there may still 
be many Pareto optimal solutions, and more criteria are needed in order to 
identify the preferred solutions. For this purpose, we combine the idea of 0- 
Pareto optimality and improvement axis, which provides a direction in which 
the improvement of the objective value, and then introduce axial preferred 
solutions to finding most preferred solution. The existence of a direction to 
reach the Pareto frontier is the idea behind proportional solutions [9]. We 
define these solutions in the context of multiobjective optimal control prob- 
lems as follows. 


Definition 7. We say that a pair (z,u) € P is a proportional solu- 
tion for MOCP if there is a strictly positive constant p € R™ such that 
J(x*,u*) = h*p, where h* = min{h € Ry|A(z,u) € P, J(x,u) = hp} and is 
denoted by (a*,u*) = P(P, J). 


Proportional solutions are not necessarily Pareto optimal, and they not 
only do not incorporate the preference information, but may also produce 
results that are unacceptable to DMs (when their preferences incorporated). 
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For this purpose, here we introduce the solutions that incorporate preferential 
information and use the idea of improvement axis. 


4 Axial preferred solutions for multiobjective optimal 
control problems 


In this section, a multiobjective optimal control problem with partial infor- 
mation is denoted by (P, J,Q). To define the axial preferred solutions, we use 
both the idea of 0-Pareto optimality and improvement axis. If the partial 
information contains rules outside the set of the proportional results, then 
the axial preferred solution will consist of those results that the decision mak- 
ing considers equivalent to a point in the improvement axis with the highest 
possible level. 


Definition 1. We say that a pair (x, u) € P is an axial preferred solution for 
MOCP with partial information if there is an improvement axis p € R71’ such 
that J(a,u) =o h*p, where h* = min{h € R,|A(a,u) € P, I(x, u) =a hp} 
and is denoted by (x*,u*) = A(P, J,Q). 


Theorem 1. The axial preferred solutions are a subset of the 0-Pareto 
optimal solutions. 


Proof. Let (a,u) € A(P,J,Q), and suppose on the contrary that the pair 
(x, u) is not an Q-Pareto optimal solution, this means that J(x,u) is domi- 
nated with respect to 9. Then, there exists a pair (%,u) € P such that 


w".J(£,U) <w".J(x,u) <w'h*p for alr =1,...,m. 


Since p € R1, for each w € 2Q, we have w.p > O and there is small enough 
€ > 0, such that 


ew’ .p > w'(J(Z,u) — h*p), 
and so we have 


w(e+h*)p>w'J(z,u), for allr=1,...,m, 


and this contradicts to h* = min{h € R,|A(#,u) € P, J(x,u) =o hp}. 


The following theorem shows that axial preferred solutions are depend 
both on the extreme points of the set of information and on the axis. 


Theorem 2. Suppose that w1,w?,...,w’ are the extreme points of the 
partial information set, and that p € R’? is the improvement axis. Then 
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A(P,J,Q) =arg min Mo(J(2,u)), 


(x,u)EP 


where 


(1) 


Mo(J (a, u)) = max { wid (zu) w*.J (xu) w™. J(u) \ 


wip ? wp 7? w™p 
Proof. Given the definition of the preference relations, we have 
h* = min{h € R|A 
=min{h € R,|A 


=min{h € R,|A 
=min{h € R,|A 


EP, J(x,u) =q h.p} 

€P,w'.J(rz,u) <w".hp, r=1,...,m} 
EP, (w". J(ax,u))/(w".p) <h, r=1,...,m} 
€ P, Mo(J(a,u)) = h}. 


r,U 
r,U 


x, U 


SSN 
Ne NS ts 


x,U 


The following corollary states that results obtained with axial preferred 

solutions for a multiobjective optimal control problem can be computed by 
solving a scalar equivalent optimal control problem. 
Corollary 1. Suppose that w!,w?,...,w™ are the extreme points of the 
partial information set, and that p € RA’? is the improvement axis, Then 
(a*,u*) € A(P, J,Q), if and only if there exists h* such that (h*,x*,u*) is 
an optimal solution to the following problem: 


min h (2) 
s.t.: w".J(a,u) <hw".p, r=1,...,m, 
(z,u) © P. 


5 Numerical examples 


In this section, we illustrate the performance of the proposed method on two 
numerical example problems, one involving Fed-Batch bioreactor in Section 
5.1 and the other, catalyst mixing problem, in a tubular reactor in Section 5.2. 
It is a common practice to approximate the state and control variables over a 
partition of the time horizon in optimal control problems, a process referred 
to as discretization to obtain an approximate solution (see, for instance, [1, 
2, 3, 10, 11]). In this section, a discretized version of problems is solved by 
using the Ipopt [21]. We use the AMPL [4] as an optimization modeling 
language, which employs the Ipopt as a solver. 
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5.1 Fed-Batch Bioreactor 


The first MOCP is based on the fed-batch lysine fermentation process inves- 
tigated by Ohno, Nakanishi, and Takamatsu [17]. The aim is to determine 
an optimal feeding profile and batch length with respect to conflicting yield 
and productivity objectives as described by Logist et al. [13]. 


dx x 
a = (0.1252 ) Z1, 


v4 

diy 0.12529 

= 1 2.8u, 
dt Gy) ve 
dx ar \? x 
si [a (0.1252 ) +134 (01252) Z1, 
dt “A LA 
dx, 
——_ = Uu. 
dt 


with the time t[h] as the independent variable. The state variables are xj |g], 
the biomass, x2|g], the substrate, x3[9], the product (lysine), and x4[L], the 
fermenter volume. The control u[L/h] is the volumetric rate of the feed 
stream, and the initial conditions are specified as 


[x1(0), 22(0), x3(0), x4(0)] = [0.1, 14, 0, 5). 


The goal is to derive a feeding strategy and batch duration that maximize the 
productivity, that is, the ratio between the product formed and the process 
duration 


while maximizing the yield, that is, the mass of product is added over the 
mass of substrate during the operation 


v3(tf) 
2.8(xa(ty) — x4(0)) 


Jp = 


To cast these maximization problems into a minimization framework, the 
objective functions are defined as the negative productivity and yield that 
J, = —J, and Jp = —Jg. Note that clearly ty is free, but it is subject to 
the constraint 20 < ts < 40. Constraints are also imposed on the fermenter 
volume, the feed rate, and the amount of substrate to be added, respectively, 
as follows: 
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0< u(t) < 2, 
20 < 2.8(awa(ts) — 5) < 42. 


The objective functionals, constraints, and the process equations as given in 
[12] yield the following biobjective optimal control problem. 


ate xa(tf) r3(tp) 
( ty” 2.8 (@alt) — =m) ; 8) 
dx, 


t 0.125? 
S.t. 3 => é ——+ x 
dt LA m3 


dt Gy Pere 
[mousy + (ome) 
—— = — {884 | 0.125— |] +134 | 0.125— }} x, 
dt LA v4 

dx4 = 

dt 


[x1 (0), x2(0), r3(0), x4(0)] = (0.1, 14, 0, 5], 
20<tp< 40, 5 <-a4(t) < 20, 
O<u(t)<2, 20 <2.8(x4(ty) — 5) < 42. 


In this problem, suppose that the DM considers the ratio two to one be- 
tween the objectives Ji(#,u) and J2(x,u), respectively (p = (2,1)). Here, 
we consider two situations, in which the preferences information of DMs 
are different. In the first situation, the preferences of the DM are repre- 
sented by the set of information Q] = {w € A}|w, > we, 3w2 > wi}. 
This means that the importance of Ji(z,w) is no less than the importance 
of Jo(xz,u), and no more than three times the importance of Jo(z,u). The 
extreme points of 9; are (4,4) and (2,4). An alternative interpretation of 
this set of information in terms of group decision making is as follows: There 
are two DMs, one of them considers the first objective three times more 
important than the second objective, while the other one considers both ob- 
jectives equally important. In the second situation, the set of information 
is Q2 = {w € Allwo > wi, 2w, > we}, extreme points of Q2 are (3, 3) 
and (3, 2), and the interpretation is analogous. We have the optimal con- 
trol problems (2) and (3) by applying preferential weights and improving the 
axis. According to Corollary 1, for this problem with the first and second 
situations, respectively, we have 
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min h 


subject to the same constraints as in (1) and 


r3(tp) r3(tp) 
3 z3lts) x3(ty) ey, 


bf 2.8(x4(t) — x4(0)) 


min h 


subject to the same constraints as in (1) and 


r3(ts) r3(tf) 

i URGE) =a) 
x3(ty) 2x3 (ts) 

ty 2.8(ea(ty) - 240) = 


The optimal solution of problems (2) and (3) are (Jf, J) = (—17.148, —18.401) 
and (Jf, J}) = (—19.971, —17.062), respectively. The optimal control and the 
optimal state profiles are shown in Figures 1 and 2, respectively. When pro- 
ductivity is preferred, the initial max part is presented in order to stimulate 
the biomass growth and hence the lysine production. However, when the 
focus shifts towards yield optimization, more lysine can be produced with 
the same amount of substrate but at the expense of longer process times. 


Feeding rate (L/h) 


o 5 10 16 20 25 30 35 40 


Figure 1: Optimal profile obtained for the control with the first and second situations 
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Figure 2: Optimal profiles obtained for the states with the first and second situations 


5.2 Catalyst mixing problem in a tubular reactor 


The second MOCP considers a steady-state plug flow reactor of fixed length 
ty. The reactor is packed with two catalysts. These catalysts are required to 
stimulate a series of reactions (one reversible and one irreversible S,  S_ 3 


S3). A dynamic model reported by Logist et al. [14] is as follows: 


dx 

- = —u(x, — 1022), 

d 

a = u(x, — 10x%2) — (1— uu), 


where x; and 2x2 are the concentrations of S; and S> and u is the fraction 
of catalyst A. The original problem, which was introduced by Vassiliadis, 
Balsa-Canto, and Banga [20], considers the optimal mixing policy of the two 
catalysts in order to maximize the production at the reactor outlet 


J, =—Ja =(1—21(ts) — ra(tp)), 


where the reactor has a length ts equal to 1. To this end the optimal catalyst 
mixing profiles u(t) and (1—u(t)) along the reactor must be determined. The 
fraction of catalyst A is bounded: 0 < u(t) < 1, which directly also bounds 
the fraction of catalyst B: 0 < 1— u(t) < 1. To introduce a multiobjective 
nature, the minimization of the amount of the most expensive catalyst (i.e., 
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catalyst A) is added as an objective 


ot 
Jo = Jp = / u(t)dt, 
0 


for given initial conditions 
(0) = [1,0]*. 


In this problem, suppose that the DM considers the ratio three to one between 
the objectives Jj(x,u) and Jo(x,u), respectively (p = (3,1)). As in the 
previous example, we consider two situations. The preferences of the DM are 
represented by the set of information Q; = {w € A!|w) > we, 3we > wi} 
and Q2 = {w € Al|we > wi, 2wi > we}. By modeling and solving the 
problem, we obtain (Jj, Jj) = (—0.047, 0.216) and (Jy, Jz) = (—0.015, 0.033) 
for the first and second situations. 

Figure 3 shows the optimal profiles for mixing u, concentrations of x7; and x2 
with the first and second situations. When the focus is put on limiting the use 
of catalyst A, it is seen that the control consists of one minimum arc, meaning 
that only catalyst B will be used. However, when the production of species $3 
does play a role, the catalyst profiles exhibit a maximum-singular-minimum 
type arc structure. The more the focus shifts towards the production of $3, 
the larger the maximum and singular arc become and the higher the values 
are during the singular interval. 


catalyst A fraction u [-] 


fraction x1 [-] 


fraction x2 [-] 


Oo 0.1 o2 03 O4 O05 O06 OF O8 O89 1 
Spatial coordinate [-] 


Figure 3: Optimal profiles for mixing u (top), concentrations of x; (middle) and z2 
(bottom) with the first and second situations 
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6 Conclusion 


In this paper, we used both the idea of improvement axis and Pareto opti- 
mality with respect to preference information and introduced a class of axial 
preferred solutions for multiobjective optimal control problems in contexts in 
which partial information on preference weights of objectives is available. It 
was shown that the axial preferred solutions are a subset of the Pareto opti- 
mal solutions. Numerical results were presented for two problems of chemical 
processes with two different preferential situations. It has been seen that dif- 
ferent preferential situations lead to different optimal solutions. 
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